Perfect $L_p$ Sampling in a Data Stream
نویسندگان
چکیده
In this paper, we resolve the one-pass space complexity of perfect $L_p$ sampling for $p \in (0,2)$ in a stream. Given stream updates (insertions and deletions) to coordinates an underl...
منابع مشابه
An Hybrid Data Stream Summarizing Approach by Sampling and Clustering
Computer systems generate a large amount of data that, in terms of space and time, is very expensive even impossible to store. Besides this, many applications need to keep an historical view of such data in order to provide historical aggregated information, perform data mining tasks or detect anomalous behavior in the computer systems. One solution is to treat the data as streams that can be p...
متن کاملPerfect sampling without a lifetime commitment
Generating perfect samples from distributions using Markov chains has a wide range of applications , from statistical physics to approximation algorithms. In perfect sampling algorithms, a sample is drawn exactly from the stationary distribution of a chain, as opposed to methods that run the chain \for a long time" and create samples drawn from a distribution that is close to the stationary dis...
متن کاملContinuous Monitoring of l_p Norms in Data Streams
In insertion-only streaming, one sees a sequence of indices a1, a2, . . . , am ∈ [n]. The stream defines a sequence of m frequency vectors x, . . . , x ∈ R with (x)i def = |{j : j ∈ [t], aj = i}|. That is, x is the frequency vector after seeing the first t items in the stream. Much work in the streaming literature focuses on estimating some function f(x). Many applications though require obtain...
متن کاملDetecting Concept Drift in Data Stream Using Semi-Supervised Classification
Data stream is a sequence of data generated from various information sources at a high speed and high volume. Classifying data streams faces the three challenges of unlimited length, online processing, and concept drift. In related research, to meet the challenge of unlimited stream length, commonly the stream is divided into fixed size windows or gradual forgetting is used. Concept drift refer...
متن کاملAntithetic Coupling for Perfect Sampling
This paper reports some initial investigations of the use of antithetic variates in perfect sampling. A simple random walk example is presented to illustrate the key ingredients of antithetic coupling for perfect sampling as well as its potential benefit. A key step in implementing antithetic coupling is to generate random variates that are negatively associated, a stronger condition than negat...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: SIAM Journal on Computing
سال: 2021
ISSN: ['1095-7111', '0097-5397']
DOI: https://doi.org/10.1137/18m1229912